A genetic circuit for epigenetic memory 

Georg Fritz 1 ' 2 , Nicolas E. Buchler 3 , Terence Hwa 4 , and Ulrich Gerland 1,5 
1 Department of Physics and CeNS, LMU Miinchen, 
Theresienstrasse 37, 80333 Miinchen, Germany 
', 2 Albert-Ludwigs University Freiburg, 

O ■ 

O ' Hermann- Herder- Strasse 3, 79104 Germany 

(N : 

£H ■ 3 Center for Studies in Physics and Biology, 



in 



o 



The Rockefeller University, New York, NY 10021 
4 Center for Theoretical Biological Physics and Physics Department, 
University of California at San Diego, La Jolla, CA 92093-0374 an d 
5 Institute for Theoretical Physics, Universitdt zu Koln, 
Ziilpicher Str. 77, 50937 Koln, Germany 

Abstract 

Background: The ability to learn and respond to recurrent events depends on the capacity 
to remember specific, transient biological signals received in the past. It is not known to which 
extent microorganisms can manipulate molecular networks in cells to implement such a memory. 
A synthetic biology approach could demonstrate the capacity of microorganisms to do so. 

Methodology /Principal Findings: Here, we propose a genetic circuit in bacteria that is capable 



o 
o 



of conditionally memorizing a signal in the form of a transcription factor concentration. The circuit 
behaves similarly to a data latch in an electronic circuit, i.e. it reads and stores an input signal only 



> 



when conditioned to do so by a read signal. The core circuit is based on the genetic toggle switch 
consisting of two mutually repressing genes, complemented with a "regulatory front end" which 
involves protein heterodimerization as a simple way to implement conditional control. Deterministic 
and stochastic analysis of the circuit dynamics indicate that an experimental implementation is 
feasible based on well-characterized genes and proteins. 

Conclusion/Significance: Our genetic circuit implements a sequential logic element, the D- 
latch, which is central to advanced signal processing, e.g. genetic counters, and finite-state comput- 
ing machines. Our results suggest that sequential logic elements may be naturally implemented by 
cells through the combination of protein-protein interactions and simple transcriptional regulation. 
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INTRODUCTION 



Biological information is primarily stored in the nucleotide sequence of the genomic DNA. 
However, cells also use other, epigenetic forms of storage such as DNA methylation patterns, 
histone modifications, and gene expression states [l[ [4] . The information contained in these 
forms of biochemical memory is selectively modified by signals during the life cycle of a 
cell, yet is stable so that cells can pass on this memory to their daughter cells for many 
generations. For instance, epigenetic memory plays an important role in development, where 
cell differentiation signals lead to chemical modification of chromatin j^]. Many signals in 
development are transient and the decision to remember or ignore these events can be 
conditioned upon the presence of hierarchical signals. Here, we consider a genetic circuit, 
which implements a conditional memory in its simplest form: it will either store or ignore 
the signal, depending on the state of a distinct "read signal". In this conditional memory 
circuit, the storage of memory is implemented via protein concentrations analogous to genetic 
switches in temperate phages Q]. 

Our starting point is the synthetic toggle switch of Gardner et al. j^J, which exhibits 
bistability. It consists of two genes, A and B, which mutually repress each other so that 
they can be in two stable expression states with either A high, B low or A low, B high, see 
Fig. [0(a). The toggle switch is coupled to two inducers, which set the switch to the ON-state 
(A high) or reset it to the OFF-state (A low), see Fig. QJb) and (c). The memory of this 
toggle switch is unconditional, in that it always stores the state set by the latest inducer 
exposure. To obtain a conditional memory, we introduce a "regulatory front end" to the 
toggle switch of Fig. QJa), through the combinatorial control by two other proteins, R and 
S, see Fig. QJd). The protein R can form homodimers R2 and repress the transcription of 
gene A. It can also bind to S to form heterodimers RS which repress the transcription of 
gene B. Qualitatively, we expect that if R is absent, neither R 2 nor RS can form: in this 
state, the existing memory of the circuit is maintained regardless of the state of the level 
of S. When a significant amount of R is present, it will mostly form homodimers R2 at low 
concentrations of S, so that gene A is repressed and the switch is forced into the OFF state. 
Conversely, when S is highly expressed, mostly the heterodimers RS will form and force the 
switch into the ON state, see Fig. [Tfe) and (f). Hence, at a high level of R, the state of 
gene A should reflect the state of S. In the language of electrical circuits, a high level of R 
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is the command (or condition) for the system to "read" the input signal (S), which is then 
"memorized" when R is subsequently set to a low level. 

Indeed, Hollis et al. [61] characterized a pair of proteins, the 434 repressor and its mutant 
A3AR[a3(P22R)], which behaves like R and S described above. The concept of a regulatory 
front end has been experimentally proven by Kobayashi et al. [7|], who interfaced several 
'sensor modules' to the toggle switch. Incidentally, both the basic toggle switch and the 
conditional memory circuit are examples of what in digital circuit theory is called sequential 
logic jlOj. In contrast to combinatorial logic, the output of which is only a function of its 
present input signals, sequential logic has an output that depends also on the history of 
the input signals. In digital circuits, sequential logic is used to construct delay and storage 
elements, and so-called finite state machines. Indeed, a typical digital circuit consists of both 
combinatorial and sequential logic. Analogously, one might expect that "computation" in 
natural genetic networks is also implemented by motifs of combinatorial and sequential logic 
elements. We have previously studied aspects of combinatorial logic in transcription control 



11 



12j. The present study is an attempt to characterize similar aspects of sequential logic 



in gene regulation, which may be used in natural or synthetic genetic circuits. In particular, 
we use the conditional memory circuit to illustrate that the combination of protein-protein 
interactions and czs-regulatory protein-DNA interactions can supply a particularly simple 
and compact way to implement complex sequential logic, without the need for additional 
genetic circuitry. 

The paper is organized as follows. We first verify the basic functionality outlined above 
within a simple and transparent gene expression model, and characterize the working princi- 
ple of the circuit from a dynamical systems perspective. Within a more detailed, stochastic 
model, we then study the effects of biochemical noise on the functionality of the circuit 
and discuss the experimental feasibility. For the toggle switch of Gardner et al. [5|, the 
sole effect of stochasticity is the spontaneous flipping of the switch js, 9]. We will see that 
the conditional memory circuit has an additional source for noise-induced errors during the 
time periods where the read signal is presented. Finally, in the discussion section, we assess 
the function of the conditional memory circuit both from a biological and a digital circuit 
perspective. 
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RESULTS 



Illustration of the circuit function with a reduced model 

The working principle of the conditional memory circuit of Fig. QJd) is best understood 
within a reduced deterministic description, which considers explicitly only the time evolution 
of the total concentrations of the proteins A and B. Such a description assumes that all 
biochemical processes which do not change the total concentrations A tot and -Btot are so 
rapid that they remain equilibrated at almost all times. The net change of A tot and B tot 
due to protein synthesis and degradation then follows rate equations of the form 



nA;ot = ®aPa(B 2 , R 2 ) — \A tot 



d 
Jt 

j t B tot = a B P B (A 2 , RS) - \B tot . (1) 

Here, we have assumed that protein degradation occurs at the constant rate X p . In contrast, 
the synthesis of proteins A and B is regulated. Their maximal synthesis rates are denoted 
by a a and a B , while the form of the regulatory control is described by the promoter activity 
functions Pa(B 2 ,R 2 ) and Pb{A2, RS). The promoter activity is the fraction of time the 
promoter is not blocked by a repressor and thereby free to bind RNA polymerase (Pa and 
Pb take on dimensionless values between and 1) [3, [ijj]. The arguments of Pa and 
Pb are the concentrations of the dimeric repressors which down-regulate the transcription. 



Within the thermodynamic model for transcription regulation 

Hal 

14j | , the promoter activity 

function Pa takes the form 

Pa(B 2 , R 2 ) = (l + J±-\ 2 (l + -^-) 1 (2) 



K 0b ' v K 0r 
and similarly, 



K OA ) V k 0rs 



Here, the K's denote the equilibrium dissociation constants in vivo for the dimer- 
operator interaction. To achieve the cooperativity required for the bistability of the toggle 



switch 151 ]; we have assumed two binding sites each for the repressor dimers A 2 and B 2 , 
which is reflected in the square of the first factor [13I, [jjj]. Note that the dimer concentrations 
A 2 and B 2 in Eqs. (T5]) and ([3]) must be expressed in terms of the total protein concentrations 
A to t and B tot to close the rate equations ([!]), see Supplementary Material for the explicit 



expressions. Similarly, the concentrations of the control proteins, R2 and RS, are functions 
of the total protein concentrations R to t and Stat, which we use to quantify the strengths of 
the read and input signals 1 . 

Without read signal (Rtot — 0), the second factor on the right hand side of (|2J) and ([2]) 
disappears and the conditional memory circuit behaves like the regular toggle switch 5j. 
The toggle switch can show three different behaviors depending on the ratio of the maximal 
promoter activities a a and «g: The switch is bistable only when a a and as are similar 
and both are sufficiently strong; otherwise it is monostable, either always ON (A high, B 
low) or always OFF (A low, B high), see Fig. [2(a). In the conditional memory circuit, the 
maximal promoter activities effectively get replaced by a a = «a/(1 + R2/K 0r2 ) and «b = 
+ RS/ Kq rs )- Hence, variation in the concentrations of the control proteins R and S 
effectively change the maximal promoter activities, and thus can be interpreted as regulated 
shifts within the state space of the toggle switch. Without read signal, the conditional 
memory circuit is in the bistable regime and remembers its state. When it receives a read 
signal, but no S signal, it moves into the monostable low-A regime as indicated by the green 
arrow in Fig. [2] (a). In contrast, when it receives an R and S signal together, it moves into 
the monostable high-A regime as indicated by the red arrow. The mechanism underlying 
these operations is illustrated in Figs. [2] (b-d), which represent the dynamic properties of 
the circuit at the three indicated points in the state diagram Fig. [2] (a): At any given time, 
the state of the two-gene circuit is specified by the two concentrations A tot and B tot and its 
dynamics is therefore represented by trajectories in the A-B-plane. These trajectories tend 
to the stable fixpoints shown as filled circles in Figs. [2] (b-d). Within the bistable regime, 
there are two stable fixpoints and the state space is divided by a separatrix into two basins 
of attraction for these fixpoints (the empty circle indicates the unstable fixpoint). When a 
read signal is given, this separatrix tilts either towards the A-axis or the B-axis, depending 
on the signal S. Thereby, the basin of attraction of one of the fixpoints is eliminated, so that 
the circuit is ultimately forced into the remaining fixpoint. The upward/downward tilting 
of the separatrix is the physical working principle underlying the two functional operations 

1 For simplicity, we do not include the possible homodimerization of S in our model (434 repressor and 
its mutant 434i?[a3(P22i?)] can both homodimerize [6(). Allowing S2 dimer formation only reduces the 
effective concentration of S available for heterodimerization and can be compensated for by increasing the 
expression rate of S. 
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of Fig. UJf). This simple picture holds only within the reduced model ((Tj), but we will see 
that the qualitative behavior survives in a more realistic quantitative description. 

Switchable hysteresis 

A key property of the conditional memory circuit is to be sensitive to the input S, if 
the read signal R is high, and insensitive to S when R is low. With our circuit design, this 
property results from a hysteresis effect, which exists at low R, but is switched off when 
R is raised. Fig. [3](a) demonstrates the switchable hysteresis in a plot of the steady-state 
concentration of gene A against S to t- At a low R level (blue curve), gene A has two steady- 
state values, one low and one high, over most of the plotted range of S (bistable regime). 
When S is raised from a low value, gene A remains in the lower steady state and switches 
to the higher steady state only at the upper end of the bistable regime. Conversely, if S 
is lowered from maximal expression, A remains in the higher steady state until the lower 
end of the bistable regime is reached. The red curve shows the behavior at high R: Over 
the entire interval, gene A has a unique steady state, and the steady-state concentration 
increases monotonically with S. Hence, the state of gene A follows that of the signal S. 

The switchable hysteresis of the conditional memory circuit is similar to that of magnetic 
memory, even though the underlying physics of a magnet is fundamentally different from 
that of a genetic circuit. It is instructive to compare their behavior: In Fig. [3](b) we show 
a plot similar to Fig. [3](a), but for a single-domain magnet, and the table in Fig. [3](c) 
summarizes the corresponding variables of the two systems. Both systems have an internal 
state that is linked to an external condition, and the link depends on a control variable. 
For the conditional memory circuit the control variable is the level of R, whereas it is the 
temperature for the magnet: above the critical temperature T c (paramagnetic phase), the 
magnetization directly follows the external magnetic field, while below T c (ferromagnetic 
phase), the magnet displays the well-known hysteresis where the magnetization remembers 
its state: upon application of an external field in the opposite direction, it inverts its 
direction only when the field has at least a certain minimal strength. Clearly, the 
conditional memory circuit displays essentially the same behavior as magnetic memory, 
even though it is a dissipative dynamical system and does not have a well-defined Gibbs 
free energy. Here, we compare only the bulk behavior of the magnet with the deterministic 
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behavior of the conditional memory circuit. However, on a qualitative level, the simi- 
larity extends beyond this "mean-field" level: fluctuations in the biochemical reactions, 
which are prominent at low concentrations, lead to spontaneous switching between the 
two stable states in the bistable regime, and similarly fluctuations in the orientation of 
single spins can induce a spontaneous flip in the magnetization for small systems of few spins. 



Circuit function with realistic parameters and in the presence of noise 

So far we have illustrated the functionality of the conditional memory circuit only 
schematically. We will now scrutinize the circuit design more closely within a detailed 
quantitative model, see Fig. H] and 'Materials and Methods. In particular, we are interested 
in the timescales of the circuit dynamics given realistic parameters, and in the reliability 
of the circuit function given the noise in the involved biochemical reactions. For instance, 
how long does a read signal have to be presented in order to ascertain that the circuit will 
have stored the signal? As a realistic example we have in mind an implementation of the 
conditional memory circuit using TetR as protein A, LacR as protein B, the bacteriophage 
434 repressor as protein R, and the mutated 434 repressor of Ref. [6] as protein S. We use the 
corresponding experimental parameters when known and typical values in all other cases, 
see Tables SI and S2 in Supplementary Material. 



Switching dynamics 

To perform its intended function, the conditional memory circuit must respond to time- 
dependent input signals in the way illustrated in Fig. H]T). To verify this response within 
the quantitative model, we prepare the circuit in the ON state (A high) and then impose 
time-dependent transcription rates v mR {t) for R and v ms (t) for S with shapes as shown in 
Fig. EJ^a) . The protocol of Fig. [5](a) tests the complete set of fundamental circuit operations 
in the order: (i) remember S high, (ii) read S low, (iii) remember S low, and (iv) read S high. 
Fig. E3^b) shows the response for both genes, A and B, within the deterministic description 
(dark green and red curve) as well as exemplary trajectories from a stochastic simulation 
(light green and orange curve, respectively). Qualitatively, the curves exhibit the desired 
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behavior: From t = — 150 min, when both R and S are in the low state (basal transcription 
rate of 0.01/min), gene A remains in the ON state. Shortly after the transcription of R is 
turned on at t = 150 min, the memory switches to the state of S, i.e. the OFF state. When 
the transcription of R is stopped at t — 210 min, gene A remains OFF, even after S is 
switched to the ON state at t — 250 min. The change in the state of S affects gene A only 
after the transcription of R is turned on again at t = 300 min. 

From Fig. G^b), we can read off the characteristic time required to flip the switch: The 
duration from the onset of the read signal to the point where gene A and B have reached 
the same expression level is about 35 min when the switch is flipped from ON to OFF and 
« 1 hour in the reverse direction. Since these timescales are relatively long compared to 
all biochemical timescales in our model, it is natural to ask what mechanism sets them. 
We found that the flipping timescale increases approximataly linearly with the protein 
half-life (data not shown). Indeed, the fundamental timescale for all changes in protein 
concentrations is set by the degradation rate, i.e. the half-live of 5 minutes in the present 
case. For the switch to flip, the concentration of the dominant repressor has to drop below 
the apparent threshold for the binding to its operator sites 2 , which amounts to more than a 
50-fold change, given the apparent binding thresholds of about 4 nM and 3 nM for the A 2 
and _E?2 operator sites, respectively. Hence, a minimum of 6 protein half-lives is required to 
flip the switch. The asymmetry between the two switching directions is mainly caused by 
the asymmetry of the regulatory front end as will become clear below. 

Noise-induced errors 

The stochastic trajectory in Fig. [5]^b) displays significant fluctuations, but behaves other- 
wise similar to the noise-less deterministic curve. While this is the case for most stochastic 
trajectories, some deviate qualitatively from the intended response. To examine these noise- 
induced errors, we performed 50000 simulation runs for the same input protocol of Fig. E(a) 
and determined the probability densities p(A tot , t) and p(B tot , t) of the total protein concen- 

2 We define the apparent binding threshold as the concentration of transcription factor, that is needed to 
reduce the promoter activity to 50% of its maximal value. It is not neccessarily equal to the equilibrium 
dissociation constant K, but rather depends on the explicit expression of the promoter activity function. 
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trations. The time-evolution of these densities is shown in Figs. 0(c) and (d), respectively. 
Noise-induced errors are clearly visible, for instance in the time interval from to 150 min, 
where a small part of the density of A tot "leaks" into the low state, while some of the B tot 
density leaks into the high state. Since the read signal is absent in this time interval, the 
leakage corresponds to noise-induced spontaneous flipping of the conditional memory circuit 
with a concomitant loss of the stored information. The same effect occurs in the standard 



genetic toggle switch, for which it has been thoroughly characterized s|, 0, In our case, 
the average lifetime of the ON state is 32 hours and that of the OFF state is around 600 
hours, i.e. both lifetimes are long compared to the timescale for controlled switching. How- 
ever, the new aspect of the conditional memory circuit is that biochemical noise leads to two 
additional types of noise- induced errors: During a read pulse (high R), the switch may not 
flip even though it is triggered to do so (false negative), or the switch may flip, even though 
it was already in the correct state (false positive). 

False negative errors are visible in Figs. [5](c) and (d) at t « 400 min right after the second 
read pulse, where a certain fraction of the A tot density remains in the low state, while the 
same fraction of the B tot density erroneously ends up in the high state. In contrast, for the 
inverse switching direction (after the first read pulse), we observe hardly any false negatives. 
Fig. [6] shows the fraction of false negatives as a function of the read pulse duration for both 
switching directions 3 . We observe that the error fraction decreases rapidly with increasing 
read pulse duration. For long read pulses, it drops below 0.5 % when switching OFF and 
below 4 % when switching ON. Note however that the timescales of Fig. [6] (a) and (b) differ 
by a factor of 3. This asymmetry is equally reflected in the deterministic 'toggle times' 
(solid lines in Fig. [6]), i.e. the minimal read pulse duration required to drive the circuit into 
the basin of attraction of the switched state in the absence of noise effects. The more than 
3-fold difference in the toggle times cannot be explained by the small difference between 
the apparent binding thresholds of the A2 and B2 operator sites (see above). Instead, the 
primary cause is the intrinsically asymmetric design of the regulatory front end: to turn ON 
the switch, both S and R proteins are required, which not only form the RS heterodimers, but 
also a noticeable amount of homodimers R2. These lead to an unwanted partial suppression 



3 We allow for a relaxation time of 60 min after the end of the read pulse and then determine the error 
fraction. Since the rate of spontaneous flipping is very low, the result depends only very weakly on the 
precise value of the relaxation time, provided it is not too short. 
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of gene A, which should be turned ON. Therefore, the protein synthesis of A is reduced, so 
that it takes a longer time to reach the threshold concentration required to flip the switch 4 . 
In contrast, when the switch is turned OFF, the level of S is low and the R proteins form 
almost exclusively homodimers, so that there is practically no repression of gene B. 

In order to quantify the rate of false positives, we prepared the circuit in the ON state, 
turned the signal S on, and then applied a read pulse to test for false positives in the ON 
state, and similarly for those in the OFF state, see Fig. S4 in Supplementary Material. In 
the latter case, the fraction of false positives is very small at < 0.1 %, while it is more 
significant in the ON state at < 4 % (see Fig. S5 in Supplementary Material), again due to 
the asymmetry of the regulatory front end. 



Adaptation of the circuit to different input signal levels 

Above, we varied the duration of the read signal, but assumed a given fixed set of concen- 
trations for the high and low levels of the R and S proteins. However, when the conditional 
memory circuit is embedded into the cellular environment, it must be adjustable to work 
with a variety of input signals, the level of which will depend on the specific context: in 
one situation an S concentration of 50 molecules per cell might correspond to the ON state 
of a signaling process, while in another situation this could be the basal level in the OFF 
state. For a given set of circuit parameters, there exists a certain threshold concentration 
(or "set point") for S, below which the memory flips to the OFF state and above which it 
flips to the ON state when a read signal is given. Similarly, there is a set point for the read 
signal, above which the circuit reads the input and below which it ignores the input. For 
the circuit design to be versatile, these set points must be programmable, so that they can 
be adjusted to lie between the typical high and low levels of S and R, respectively. For our 
circuit design this can be achieved by exploiting the programmability of operator binding 



affinities through simple changes in their nucleotide sequence [ll|: the response of the toggle 
switch of Fig. H] (top) to the regulatory front end (bottom) critically depends on the binding 
thresholds of the R2 and RS binding sites. Fig. S6 in the Supplementary Material shows 
that variation of these binding thresholds allows to adjust the set points for S and R over 



4 Note that while the time to reach the steady state concentration only depends on the degradation rate, 
the time to reach a certain threshold concentration also depends on the synthesis rate. 
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1-2 orders of magnitude. Thus, we expect that the circuit can easily be adapted to work 
under a wide range of conditions. 



DISCUSSION 

Circuit function from a biological perspective 

In this paper, we described the design of a simple genetic circuit to implement conditional 
memory in bacteria, and characterized the circuit dynamics theoretically. The additional 
layer of control (compared to the basic toggle switch) dictates the condition (level of R) 
by which transient information (level of S) may be stored in the toggle switch. Conditional 
memory would then enable cells to manipulate information "collected" under different condi- 
tions at different times. Such capabilities can provide selective advantages to microorganisms 



in time varying environments 17j. For instance, under repeated cycles of famine and feast, 



bacteria which can remember certain environmental traits during feast may formulate better 



survival strategies at the time of famine. 



may be controlled by_a "sensor module 
density of bacteria 



n this functional context, the transcription of S 



,1 



responding e.g. to the light intensity [18] or 
19j , while the transcription of R may be driven by metabolic or growth 
regulators that signal the internal state of the cell. More generally, it has been shown 
on theoretical grounds by Kussell and Leibler that cell populations can benefit from 
stochastic phenotypic switching with memory, where the individuals remember the last few 
phenotypic switches that occurred in their ancestral history. This benefit arises when the 
fluctuations of the environment exhibit longer correlations. The implementation of stochas- 
tic phenotypic switching with memory would require advanced signal processing capabilities 
for which the functionality of the conditional memory circuit is fundamental (see also the 
discussion further below). 

Crucial "device properties" of conditional memory include a rapid timescale of active 
switching, a slow spontaneous loss of the memory content, and a versatile interface to the 
input signals. According to our design and analysis, we expect the circuit to be able to 
respond rapidly to variations in input signals, on a time scale as short as 30 min (if active 



protein degradation is used |20j). Our analysis also suggests that a broad parameter regime 



can be found for which the memory is stable to stochastic fluctuations in gene expression 
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for many cell generations. Furthermore, we showed that the circuit can easily be adjusted 
to function with a wide range of input signal amplitudes (e.g., the transcription rates for R 
and S), and therefore could be employed as a functional module in many different contexts. 

It is presently not known whether microorganisms do in fact use conditional memory. Our 
study indicates that they can implement conditional memory with readily available compo- 
nents. We hope this work will stimulate the construction and experimental characterization 
of the conditional memory circuit. We feel that its implementation would be a milestone in 
synthetic biology, which through its forward engineering approach complements the reverse 
engineering efforts in microbiology. 



Sequential logic in gene regulation 

An important motivation for our study was to explore the design of sequential logic in 
gene regulation. Our case study of the conditional memory circuit permits some general 
conclusions, which are best appreciated by recalling the basic hierarchy of digital logic 
elements 10j. The minimal elements of digital logic are combinatorial logic gates, which 
assign an output value to two (or more) inputs according to a fixed rule (e.g. AND, OR, 
XOR, etc). Most tasks in digital electronics require sequential logic elements, which yield 
an output level that depends not only on the input levels, but also on a stored level. The 
most basic sequential logic element is an uncontrolled RS-latch (or flip-flop), a circuit with 
two states, '0' and '1', which can be affected at any instant of time by two separate inputs, 
which either set it to state 1 or reset it to state 0. Such uncontrolled (or 'asynchronous') 
elements are rarely used in circuits, as they tend to produce unstable circuit behavior. 
Thus, sequential logic elements are made sensitive to a control (or 'enable') signal, typically 
a clock, which determines whether or not the element responds to the inputs. However, even 
with the control, the RS-latch has an undesirable ambiguous input condition when both the 
set and reset signal is presented. In practice, data latches ('D-latches'), which are free of 
ambiguous input conditions, are used instead. All of the sequential logic elements in digital 
electronics are constructed by cascading several combinatorial logic gates and introducing 
feedback paths. 

In gene regulation, combinatorial logic can be flexibly implemented by cis-regulatory 



transcription control 12 ] . The genetic toggle switch of Gardner et al. [5] and its extensions 
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by Kobayashi et al. jj] implement an uncontrolled RS-latch with several variations in the 
type of the input signals. In contrast, our conditional memory circuit implements a data 
latch. In principle, data latches could be constructed by the same strategy as in digital 
electronic circuits, i.e. by cascading multiple transcription factor genes under combinatorial 
transcription control and introducing feedback. However, such designs would require many 
genes and result in slow operation. In this manuscript, we have studied an alternative 
strategy: The conditional memory circuit is a very compact design, where the properties of 
typical protein regulators and transcriptional control mechanisms in bacteria are exploited to 
implement the functionality of a data latch. The central idea behind this seemingly 'natural' 
strategy is to synergistically combine simple protein-protein and protein-DNA interactions 
to achieve complex function. We performed our analysis for parameters associated with a 
specific choice for the R/S pair, the 434 repressor and its mutant 434R[a3(P22R)], because 
their properties had already been characterized quantitatively [6]. However, we believe such 
pairs may be readily generated by synthetic design or natural evolution: As demonstrated 
by Dmitrova et al. 21], not only can DNA binding domains of the transcription factors 
be altered to enable different binding specificities, the dimerization domain can also be 
manipulated to enable the desired homo- and hetero-dimerization required by our design for 
the conditional memory circuit. 



MATERIALS AND METHODS 



Our detailed model for the conditional memory circuit describes the explicit dynamics 
of all biochemical processes shown in Fig. HI The top part of Fig. H] depicts those reactions 
which are already involved in the toggle switch, whereas the bottom part shows the additional 
reactions for the conditional memory. Transcription from promoter A and promoter B is 
regulated by independent 5 binding of the dimeric repressors A 2 , B 2 , R2, and RS to their 
respective operator sites. We explicitly describe the dynamics of the repressor-operator 
interactions to take into account the effect of operator state fluctuations [8]. Transcription 
occurs only when no repressor is bound to the promoter. In this state, mRNA molecules 
are produced at the rates v mA and v mB , respectively (we will assume strong promoters with 

5 We assume no direct interaction between the DNA-bound repressors. This assumption is conservative, 
since cooperative interactions would only help to make the bistability of the circuit more pronounced. 
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v m A = *Vb = 5 min -1 ). The mRNAs are translated at a rate and degraded at a rate A m 
(we fix the average mRNA half-life to a typical value of 3 min 22| and assume v v « 2.3 min -1 
to obtain on average 10 proteins per mRNA molecule, again a typical value 23|, |24|, |25| ) . 
The rate for protein turnover, A p , is an important parameter, which sets the timescale of the 
circuit behavior (see below). A rapid response of the memory can be obtained only when 
degradation is rapid, a constraint that is well known for the toggle switch [5(. We assume 
that all proteins are actively degraded with half-lives of 5 minutes 6 , which is usually achieved 



in synthetic gene circuit experiments by SsrA tags [27|, |28|, |29|]. Our rate constants for the 
association of monomers and dissociation of dimers are listed in Supplementary Material, as 
are the rates for the binding and unbinding of the dimers to their operators 7 . 

For the control proteins R and S, we describe the expression dynamics in the same 
way as for A and B, however we consider their transcription rates u mR (t) and v ms (t) as 
time-dependent input signals for the genetic circuit 8 . For instance, the protein R could be 
coupled to the circadian rhythm, with a periodic transcription rate u mR (t) caused by other 
regulatory processes in the cell. And the transcription of protein S could be regulated by 
a signal transduction pathway that is sensitive to a time- dependent external signal. Our 
aim is to characterize the response of our genetic circuit model to different forms of the 
input signals v mR (t) and u ms (t). To this end, we solve the deterministic rate equations 
for all reactions by numerical integration, and also perform stochastic simulations of the 



same reactions using the algorithm of Gillespie 30|. The dynamics of our detailed model 
simplifies to that of the reduced model (0Q) in the limit where (i) all protein concentrations 
are high, (ii) the dimerization and DNA binding reactions are rapid, and (iii) the mRNA 
concentrations equilibrate much faster than the protein concentrations. In this limit, the 



variables v4 to t and B tot are the only relevant slow degrees of freedom [3J|. However, in our 
case these conditions are not met, since the proteins with SsrA tags are degraded almost as 
rapidly as the mRNA. Hence, we explore the behavior of the full model. 



6 We assume the same degradation rate for protein monomers and dimers, i.e. no cooperative stability 26|. 

7 In our model, dimerization occurs always prior to operator binding. This pathway is consistent with 
typical parameter values for bacterial transcription factors. 

8 Note that we could equally well assume that the transcription of R, S is constant, but their dimerization or 
their DNA-binding activity is time-dependent, e.g. due to regulation by ligand binding or phosphorylation. 
However, none of our conclusions are sensitive to the detailed mechanism that in one way or another 
controls the total concentrations of active R and S proteins. 



14 



ACKNOWLEDGMENTS 



We thank W. Hillen and G. Koudelka for providing useful information on TetR and 434 
repressor. 

Author contributions. GF, NB, TH, and UG designed the research. GF performed the 
simulations. GF, NB, TH, and UG wrote the paper. 

Funding. NB acknowledges a Burroughs- Wellcome Career Award, TH acknowledges 
support by the NSF through Grant No. MCB-0417721 and PFC-sponsored Center for 
Theoretical Biological Physics (Grants No. PHY-02 16576 and PHY-0225630), and UG ac- 
knowledges an Emmy Noether grant of the Deutsche Forschungsgemeinschaft. 

Competing Interests. The authors have declared that no competing interests exist. 



[1] Jaenisch, R k Bird, A. (2003) Nat. Genet. 33, 245-254. 
[2] Casadesus, J k D'Ari, R. (2002) Bioessays 24, 512-518. 
[3] Arney, K k Fisher, A. (2004) J. Cell. Sci. 117, 4355-4363. 

[4] Trusina, A, Sneppen, K, Dodd, I, Shearwin, K, k Egan, J. (2005) PLoS Comput. Biol. 1, e74 
[5] Gardner, T. S, Cantor, C. R, k Collins, J. J. (2000) Nature 403, 339-342. 
[6] Hollis, M, Valenzuela, D, Pioli, D, Wharton, R, k Ptashne, M. (1988) Proc. Natl. Acad. Sci. 
USA 85, 5834-5838. 

[7] Kobayashi, H, Kaern, M, Araki, M, Chung, K, Gardner, T. S, Cantor, C. R, k Collins, J. J. 

(2004) Proc. Natl. Acad. Sci. USA 101, 8414-8419. 

[8] Kepler, T. B & Elston, T. C. (2001) Biophys. J. 81, 3116-3136. 

[9] Aurell, E k Sneppen, K. (2002) Phys. Rev. Lett. 88, 048101. 
[10] Katz, R k Boriello, G. (2004) Contemporary Logic Design (Prentice Hall). 
[11] Gerland, U, Moroz, J, k Hwa, T. (2002) Proc. Natl. Acad. Sci. USA 99, 12015-12020. 
[12] Buchler, N. E, Gerland, U, k Hwa, T. (2003) Proc. Natl. Acad. Sci. USA 100, 5136-5141. 
[13] Bintu, L., Buchler, N.E., Garcia, H.G., Gerland, U., Hwa, T., Kondev, J., k Phillips, R. 

(2005) Curr. Opin. Genet. Dev. 15, 116-124. 

[14] Bintu, L., Buchler, N.E., Garcia, H.G., Gerland, U., Hwa, T., Kondev, J., Kuhlman, T., k 
Phillips, R. (2005) Curr. Opin. Genet. Dev. 15, 125-135. 



15 



[15] Cherry, J. L & Adler, F. R. (2000) J. Theor. Biol. 203, 117-133. 

[16] Allen, R. J, Warren, P. B, & ten Wolde, R R. (2005) Phys. Rev. Lett. 94, 018104. 

[17] Kussell, E. & Leibler, S. (2005) Science 309, 2075-2078. 

[18] Levskaya, A., Chevalier, A., Tabor, J., Simpson, Z., Lavery, L., Levy, M., Davidson, E., 

Scouras, A., Ellington, A., & Marcotte, E. (2005) Nature 438, 441-442. 
[19] Bassler, B. L. (1999) Curr. Opin. Microbiol. 2, 582-587. 
[20] Karzai, A, Roche, E, & Sauer, R. (2000) Nat. Struct. Biol. 7, 449-455. 

[21] Dmitrova, M., Younes-Cauet, C, Oertel-Buchheit, P., Porte, D., Schnarr, M. & Granger- 

Schnarr, M. (1998) Mol. Gen. Genet. 257, 205-212. 
[22] Bernstein, J. A, Khodursky, A. B, Lin, P.-H, Lin-Chao, S, k Cohen, S. N. (2002) Proc. Natl. 

Acad. Sci. USA 99, 9697-9702. 
[23] Arkin, A, Ross, J, & McAdams, H. H. (1998) Genetics 149, 1633-1648. 
[24] Cai, L, Friedman, N, & Xie, X. S. (2006) Nature 440, 358-362. 
[25] Yu, J, Xiao, J, Ren, X, Lao, K, & Xie, X. S. (2006) Science 311, 1600-1603. 
[26] Buchler, N. E, Gerland, U, & Hwa, T. (2005) Proc. Natl. Acad. Sci. USA 102, 9559-9564. 
[27] Elowitz, M. B & Leibler, S. (2000) Nature 403, 335-338. 

[28] Basu, S, Gerchman, Y, Collins, C. H, Arnold, F. H, & Weiss, R. (2005) Nature 434, 1130-1134. 
[29] Fung, E, Wong, W. W, Suen, J. K, Bulter, T, Lee, S.-g, k Liao, J. C. (2005) Nature 435, 
118-122. 

[30] Gillespie, D. (1977) J. Phys. Chem. 81, 2340-2361. 

[31] Bundschuh, R, Hayot, F, & Jayaprakash, C. (2003) Biophys. J. 84, 1606-1615. 



16 



(a) inducer i, Toggle Switch (d) Conditional memory circuit 



T 

Inducer I2 



(b) 







l 1= HI 


(e) 


S=LO 


S=HI 


l 2 LO 


keep state 


A OFF & BON 


R LO 


keep state 


keep state 


l 2 =HI 


AON SB OFF 


A ON & B ON 
(forbidden) 


R=HI 


A OFF & B ON 


A ON & B OFF 



Toggle ON 



Toggle OFF 

_0_ 



(f) 

i Remembers i Remembers 
Rt — ( S = HI ) R "—i (S = LO) 



— J (S = HI) " —J 



a. 1 



FIG. 1: Design and Function of the Toggle Switch Compared to the Conditional Memory Circuit. 
In the toggle switch (a) of Gardner et al. [5] the bistable circuit of two mutually repressing genes 
A, B is controlled by two inducers I±, I2, effectively implementing the rules (b). The diagrams (c) 
illustrate how the switch is "set" to the ON state, i.e. high (HI) expression of A, and "reset" to 
the OFF state, i.e. low (LO) expression of A, by pulses of I\ and I2, respectively. In contrast, the 
conditional memory circuit (d) is regulated by the transcription factors R and S. They form hetero- 
and homodimers RS and R2 repressing the transcription of gene A and B, respectively. Effectively, 
the circuit remembers the expression state of S during the last pulse of R. Hence, R functions as a 
read signal for the information contained in S, as illustrated in the table (e) and the diagrams (f). 
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(a) 



FIG. 2: Working Principle of the Conditional Memory Circuit. 

(a) When the concentrations of the control proteins R and S change, the circuit moves in the state 
diagram of the toggle switch, which displays either bistable or monostable behavior depending 
on the (effective) promoter strengths of gene A and B. (b-d) The control proteins R and S tilt 
the separatrix, which separates the basins of attraction of the two stable fixpoints (filled circles). 
When the circuit reaches the borderline to a monostable regime in (a), one of the stable fixpoints 
"annihilates" with the unstable fixpoint (empty circle). 
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(a) conditional memory circuit (b) spin system (magnet) 
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FIG. 3: Switchable Hysteresis and Analogy to Single-Domain Magnet. 

(a) Hysteresis in the conditional memory circuit, as seen in the dependence of the steady-state 
concentration of gene A on the level of S. At a low R level (R < R c , blue curve), gene A has 
two steady-state values over most of the plotted range of S (bistable regime). At a high R level 
(R > R c , red curve), the bistable regime disappears, and the state of gene A always reflects that of 
the signal S. See Fig. S3 in Supplementary Material for the corresponding contour lines in the state 
diagram, (b) The analogous diagram for a single-domain magnet, (c) Corresponding variables of 
the conditional memory circuit and the magnet. 
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FIG. 4: Illustration of the Quantitative Model. 

The model comprises the processes of transcription, translation, dimerization, operator binding 
and degradation of mRNA and proteins. All reactions are shown together with their associated 
reaction rate or, in the case of reversible reactions like dimerization or protein-DNA binding, their 
respective equilibrium constant (our results are based on simulations of the full dynamics). 
The upper part depicts the toggle switch module, in which the dimeric proteins of one species can 
bind to the promoter region of the other one. As soon as one of the two operator sites in either 
of the promoter regions is occupied, downstream transcription is inhibited. Although we do not 
consider cooperative interactions of adjacently bound transcription factors, the integration of two 
binding sites for both A and B is essential for the emergence of bistability. 

The lower part shows the regulatory front end dictating the state of the toggle switch via two 
additional binding sites for R2 and RS downstream of the transcriptional start sites of genes A and 
B, respectively. The two inputs to the circuit are the transcription rates v mR and v ms of R and S. 
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FIG. 5: Response Characteristics of the Conditional Memory Circuit. 

The time-dependent response of the circuit to the input signals (a) is shown in (b-d). The dark 
red and dark green line in (b) show the total concentrations Atot and -Btot calculated from the 
deterministic rate equations, whereas the light red and green lines are obtained from a single 
stochastic simulation run. The time evolution of the probability densities for A tot and -Btot are 
shown in (c) and (d), respectively (the densities are obtained from 50000 stochastic simulation 
runs; the color codes for the observed number of trajectories inside each bin). 
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FIG. 6: Switching Errors. 

The fraction of false negative errors as a function of the read pulse duration for (a) switching into 
the OFF state and (b) into the ON state. The circles indicate data from stochastic simulations 
(the interpolating dashed lines are guides to the eye) and the solid lines indicate the minimum read 
pulse duration for switching in the absence of noise (deterministic description). 
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